[in]appropriate Use of Statistical Measures in [the Name Of] Balancing Data Quality and Confidentiality of Tabular Format Magnitude Data

نویسنده

  • Ramesh A. Dandekar
چکیده

Statisticians are aware of the fact that measures such as: mean, variance, Pearson correlation coefficient are disproportionately influenced by relatively few extremely large observations and, therefore, are unreliable as statistical measures in comparing overall quality of data with an extremely skewed distribution. Tabular data cells follow an extremely skewed distribution. In this paper we show that linear-programming-based controlled tabular adjustments (CTA), which generates synthetic tabular data (Dandekar2001), makes use of a least absolute difference linear regression model and is well-suited to control overall data quality on its own without additional steps proposed by quality preserving controlled tabular adjustments (QP-CTA) that has been heavily promoted to the statistical community since 2003.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

United Nations Statistical Commission and European Commission Economic Commission for Europe Statistical Office of the Conference of European Statisticians European Communities (eurostat) Joint Ece/eurostat Work Session on Statistical Data Confidentiality Balancing Data Quality and Confidentiality for Tabular Data Invited Paper

1. Tabular data are the earliest form and remain a staple of official statistics data products. Familiar examples of tabular data products in official statistics include count data such as age-race-sex and other demographic data, concentration (or percentage) data such in financial or energy utilization statistics, and magnitude data such as total retail sales or air pollution data. Confidentia...

متن کامل

Mathematical Programming Models for Balancing Data Quality and Confidentiality in Tabular Data

1. Mathematical Programming Model for Controlled Tabular Adjustment (CTA) Statistical agencies use different methods to protect the confidentiality of tabular data. The most widely used method, complementary cell suppression, suppresses both primary (sensitive) and secondary (non-sensitive cells) to assure confidentiality. Despite its popularity, it suffers from severe limitations. Complementar...

متن کامل

Two-Dimensional Compact Inversion of Magnetic Data in the Presence of Remanent Magnetization

Remnant magnetization causes a change in the direction and intensity of the magnetization vector. If inversion is performed regardless of remnance, in some cases it may have unreliable and misleading results. For inversion with respect to remnant magnetization, several solutions have been proposed so far, one of which is to convert the data of total magnetic field into data that is independent ...

متن کامل

بررسی رابطه افسردگی با کیفیت زندگی بیماران مبتلا به آرتریت روماتویید

  Background : R heumatoid arthritis is one of the most common chronic diseases of unknown causes leading to significant disability especially in adult patients. The depression can considerably affect the patients’ quality of life. Background & Objective  Materials & Methods: In this study, 190 patients who had referred to rheumatology and internal wards of Tehran hospitals were randomly select...

متن کامل

Evaluating the Use of Electronic Personal Information Management Components by Faculty Members

Background and Aim: The aim of this study is to assess the Iranian Personal electronic information management of knowledge and information science and medical Library and Information Sciences faculty members based on the Jones model. Method: This study is kind of application research and in terms of data collection is descriptive and analytical study. The statistical population included faculty...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2012